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ABSTRACT 

Using the cosmological smoothed particle hydrodynamical code GADGET-3 we make a realistic 
assessment of the technique of using constant cumulative number density as a tracer of galaxy evolution 
at high redshift. We find that over a redshift range of 3 < z < 7 one can on average track the growth 
of the stellar mass of a population of galaxies selected from the same cumulative number density bin 
to within ~ 0.20 dex. Over the stellar mass range we probe (10 10 39 < M s /Mq < 10 10 ' 75 at z = 

3 and 10 848 < M s /Mq < 10 955 at 0 = 7) one can reduce this bias by selecting galaxies based on 
an evolving cumulative number density. We find the cumulative number density evolution exhibits a 
trend towards higher values which can be quantified by simple linear formulations going as —O.IOA 0 
for descendants and O.12A0 for progenitors. Utilizing such an evolving cumulative number density 
increases the accuracy of descendant/progenitor tracking by a factor of ~ 2. This result is in excellent 
agreement, within 0.10 dex, with abundance matching results over the same redshift range. However, 
we find that our more realistic cosmological hydrodynamic simulations produce a much larger scatter 
in descendant/progenitor stellar masses than previous studies, particularly when tracking progenitors. 
This large scatter makes the application of either the constant cumulative number density or evolving 
cumulative number density technique limited to average stellar masses of populations only, as the 
diverse mass assembly histories caused by stochastic physical processes such as gas accretion, mergers, 
and star formation of individual galaxies will lead to a larger scatter in other physical properties such 
as metallicity and star-formation rate. 

Subject headings: cosmology: theory — stars: formation — galaxies: evolution - galaxies: formation 
- methods: numerical 


1. INTRODUCTION 

Understanding how galaxies evolved from minute per¬ 
turbations in the distant Universe into the diverse zoo of 
shapes and sizes we see today is one of the fundamen¬ 
tal goals of modern astronomy. The current frontier lies 
at the edge of the observable universe, ~ 500 Myr after 
the Big Bang, and is primarily possible using the Wide 
Field Camera 3 (WFC3) instrument aboard the Hub¬ 
ble Space Telescope ( HST ). Using the Lyman-break tech¬ 
nique and/or photometric redshifts to select candidate 
galaxies, progr ams such as CAND ELS (Pis Faber & Fer¬ 
guson), BoRG (Trenti et al. |20m>, and HU DF09/UDF12 


ulation for study. 

From these surveys, fundamental properties such as 
stellar mass, age and star formation rate (SFR) can be 
derived by comparing the spectral energy distributions 
(SEDs) of galaxies to stellar population models. By se¬ 
lecting galaxies at different epochs (“snapshots” in time), 
we can in principle directly observe how galaxies evolve. 
However, tracing galaxies from one epoch to another has 
proven challenging, and frequently accompanied by mis¬ 
interpretation. Previous studies have matched galaxies 
at different epochs by comparing samples selected to have 
similai^_physical tr acers, such as at fixed UV luminosity 


jBouwens et ah||201ll lEiiis et al.lfOT, are able to ^ l Stark et al.| [2009|) This can be problematic as 

. ----■--- L- --—- —. u l □ 1 . F txtya n o I ovioc /on to of y— on r\ 1 no nthor of y—X Tinf n 


i dentify galaxies feT t o restUrame SjVmagnitudes of two .galaxies, one at 0=6 and the other at 0=3, with 

- - --- I _ ■ ' _ mini fir I T \ / liminocnliot' onn i-l nmro rlvivmofiori Kr rfvH-Av 


M 11V - 

— 17.5 (e.g., Finkelstein et al.12012 ?; Trenti et al. 

2011 

Bouwens et al. 

2012 

Ellis et al. 

2013 

). Two of 


these galaxies have been spectroscopically confirmed to 
be the earliest known galaxies to date with redshifts of 
g=7.5 1 (Finkelstein et al.|2013) and 0=7.73 (Oesch et al. 
20151. With the help of the gravitational lensing effect 
of massive f oreground galaxy clus ters, campaigns such 
as CLASH (Bouwens et al. 2014) and the HST Fron¬ 
tier Fields will extend our understanding even further to 
0 > 9 and UV magnitudes as faint as M uv ~ —13, greatly 
increasing the dynamic range of the observed galaxy pop- 
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similar UV luminosities could have dramatically differ¬ 
ent mass assembly histories depending on their individ¬ 
ual star formation histories (SFHs), environments and/or 
merger histories. 

A novel appr oach to this problem was suggested by |van| 
Dokkum et al. (2010) who tracked a population of galax¬ 
ies through cosmic time ( 0=2 to 0 = 0 . 1 ) selected to have 
a constant number density, using the observed cumula¬ 
tive galaxy stellar mass function (CSMF). The critical 
assumption this approach makes is that galaxies in the 
same number density bin will grow at a similar, smooth 
rate with a conserved rank order. Monte Carlo simu¬ 
lations were utilized to test the effects of mergers and 
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starbursts, and it was concluded that only 1:1 mass-ratio 
mergers were able to break the rank order of galaxies. 
The use of this technique was tested numerically by 


Leja et al. (2013) u sing the Millennium d ark matter 
(DM) halo cata log (ISpringel et al.| |2005|) 
analytic models (Guo et al.||2Ull). They torn 


and semi- 
found that the 
actual mass growth tracked the mass growth inferred at 
a constant number density to within 40% over a redshift 
range of z=3 to z= 0. They note, however, that they did 
not reproduce the observed mass evolution of galaxies. 
This was attributed to the fact that the sub-grid physics 
(star formation, feedback, mergers) in the semi-analytic 
models utilized for their project might not be accurately 
c apturing the physics o f the observed galaxies. 


Papovich et al. (2011) extended this technique to track 
the growth of observed galaxies at much higher redshift, 
from z=8 to z=3, concluding that galaxies exhibit a SFH 


tions of decaying SFHs, see also 

Reddy et al. 

(2012): 

Fin- 

lator et al. 

(2011); 

Jaacks et al. 

(2012)). This result has 


erties of observed galaxies. To justify the use of this 
method at high-redshift, the authors used the DM halo 
catalogs from the Millennium simulations to track halo 
growth, making the assumption that halo growth directly 
tracks galaxy growth (i.e., n ga i(> L uv ) « n ga i(> M s ) ~ 
n ga i(> M v i r )). While they found that halo mass growth 
at a constant number density of n = 2 x 10 -4 Mpc -3 was 
indeed smooth over the same time period, there was no 
consideration for the physics of formation and evolution 
of the galaxies themselves. 

A more recent numeric al test at high-z was performed 
by B ehroozi et a l. (2013a) using the Bolshoi DM halo cat¬ 
alog ( Klypin et al.|2Ull ) along with the merger tree and 
an abundance matching technique which assigns galaxies 
to halos based on their relative number densities. This 
study found that when comparing stellar mass evolution 
across redshift one must account for the fact the the num¬ 
ber density of a given population is not constant to avoid 
mass est imate errors of ~ 0.3 d ex. While the technique 
used by Behroozi et al. (2013a) did extend to tracking 
descendants of z=ti galaxies, it did not explicitly include 
the details of fundamental physics involved in galaxy for¬ 
mation and evolution. 

A contemporan eous work by Torrey et al. (|2015 ) uti¬ 
lized the Illustris (Vogelsb erger et al.|2L)14 iTiyarodynam- 
ical simulations to test the constant number density evo¬ 
lution tracking method at 0 < z < 3, and found that 
galaxies do not evolve at constant number densities due 
to the effects of merger events and scatter in individual 
galaxies growth rates. We will show that these results 
are very consistent to this work even though focused on 
different redshift epochs. 

In this work we will use cosmological smoothed par¬ 
ticle hydrodynamical code GADGET-3, where galaxies 
form in-situ, to test the technique of using constant cu¬ 
mulative number density as a tracer for tracking galaxy 
evolution at high-redshift (i.e. 3 < z < 7). Motivated by 
recent tests of this technique which rely the combination 
of dark-matter-only simulations with either abundance 
matching or semi-analytic models, we intend to capture 
the realistic mass assembly histories of galaxies in simula¬ 
tions with sophisticated baryonic physics. This work will 


TABLE 1 

Simulation Parameters 


Box size 

N p 

m D M 

T^gas 

e 

(/i 1 Mpc) 

(DM, Gas) 

(V'Me) 


(/i 1 kpc) 

100.0 

2x800 3 

1.17x10 s 

2.38X10 7 

5.00 


Simulation parameters used in this paper. The parameter N p is 
the number of gas and dark matter particles; ranM and m gaa are 
the particle masses of dark matter and gas; e is the comoving 
gravitational softening length. 


be organized as follows: In Section [2] we will detail the 
simulations used in this exercise; Section [3] discusses the 
methods used for identifying and tracking galaxies in our 
simulations; Section H presents our results; In Section [5] 
we present additional discussion, and we summarize in 
Section [6j 


2. SIMULATIONS 


We use a modified version of the smoothed particle 
hydrodyna mics (SPH) cod e GADGET-3 (originally de¬ 
scribed in Springel 2005). Our code includes radia¬ 


tive^ coohngbyTTTTieTlrnd metals (Choi & Nagamine 


Faucher-Giguere et al 

2009 

), the UVB self-shielding ef- 

feet (Nagamine et al. 

2010). the initial power spectrum 

of 

Eisenstein & Hu (1 

999), 

supernovae feedback, a sub- 


20031, Multicomponent Variable Velocity (MW) wind 


model (Choi fc Nagamine 


2011) and H 2 regulated star 


formation (Thompson et aT 2014a|). Our current simu¬ 
lations do not include AGIN feedback which is expected 
to contribute significantly a t z < 3 in massive galax ies. 


See Thompson et al. (2014a) and Jaacks et al. 
detailed comparisons of our simulations to bot 


(2013) for 
r high and 


low redshift observations. In particular we find excellent 
agreement with both the observed z=4-7 UV luminosity 
functions in both faint-end slope (a) and characteristi c 
brightness ( My V ) found in ? and Bouwens et al. (2015|), 
as well as the number densities (d>) from the observed 
stellar mass functions in Song et al. (120151 over the mass 
range studied here. 

Our simulation is set up with 800 3 particles for both 
gas and dark matter in a comoving box with sides of 
100ft. -1 Mpc. We will refer to this run as N800L100 and 
complete simulation para meters are summarized in Ta- 
blejl] We use the 2LPTic (Scoccimarro 19981 code which 
utilizes second-order Lagrangian perturbation theory for 
generating our initial conditions at z=99. The adopted 
cosmo logical parameters are consistent with the WMAP7 
data (|Komatsu et al.||2009|: D m = 0.26, = 0.74, 

D b = 01043, ft = 0772, er 8 ~^0.80, n s = 0.96. 

Snapshots were recorded every 20 Myr starting at z=10 
allowing us to more accurately track the growth of each 
galaxy, resulting in 72 snapshots from z — 7 to 3. Galaxies 
were identified in each snapshot usi ng a variant of the 
SUBFIND algorithm ( Springel et al.|2001 ) which groups 
particles based on baryonic density peaks. A minimum 
of 32 particles (gas+stars) is required to be considered 
for grouping. However we require a minimum of 100 
baryonic particles and at least 20 star particles for the 
galaxies to be considered within our mass resolution. 

It should be noted that our simulations assumed a 


Chabrier initial mass function (IMF) (Chabrier 2003) 
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TABLE 2 

Identification, mass and total number of galaxies for each number density bin used in this study 


ID 

n (center) 
[Mpc -3 ] 

logn 

logM s at z =7 
[M@] 

log M s at z—3 
[Mq] 

total # 

nl 

2.00 x 10 -4 

-3.70 ±0.20 

o 7n +0.22 

' U -0.23 

10.38i°;i° 0 

538 

n2 

8.10 x 10 -5 

-4.10 ±0.20 

9-io±g;l| 

10-54t°;“ 

201 

n3 

2.87 x 10 -5 

-4.50 ±0.20 

9.40l°j| 

10.69±°;gl 

75 



Fig. 1.— Cumulative stellar mass function for z = 3, 4, 5, 6 and 7 
(solid blue lines, top to bottom). The horizontal solid, dashed and 
dashed-dot lines represent constant number density cuts nl, n2, 
n3 (see Table [ 2 ] for details), respectively. For each number den¬ 
sity studied, the total bin width is shown by the horizontal orange, 
green and blue shaded areas. The gray shaded region denotes the 
approximate mass resolution for our simulation run, which corre¬ 
sponds to rsj 100 baryon particles. The mass value associated with 
the intersection of the CSMF and the constant number density cuts 
will be referred to as the inferred mass growth. 


and when appropriate, data su ch as stellar ma sses which 
assumed a different IMF (e.g., Salpeter (1955)) are con¬ 
verted for consistency using M T mf = Msainster/ Itmf 


where f tmf = 1-6 fo r Chabrier. (Nagamine et al. 2006 
Raue fc Meyer|[2012 ). 


3. METHODS 
3.1. Galaxy selection 

Galaxies are selected based on symmetric number den¬ 
sity bins as determined bv the cumulative stellar mass 
function (CSMF, Figure [l| from our simulation at z= 7. 
For our fiducial number density bin to select galaxies, 
we use n = 2.0 x 10 -4 [Mpc -3 ], a choice w hich is mo¬ 


tivated by i ts use in previous studies (e.g.,. _ 

et al.||20ib| |Papovich et al.||2011| |Leja et al'.||2013[). 1 
number density can then be translated via the CSMF 


van Dokkum 
This 

density can then be translated via the CSMF to 
a corresponding stellar mass of M s « 10 8 7 Mq at z=7. 
We also explore lower number densities of n = 8.10x 10 -5 
[Mpc -3 ] and n = 2.87 x 10 -5 [Mpc -3 ] which have cor¬ 
responding x = 7 masses of M s « 1O 91 M 0 and M s ss 
10 9 4 Mq, respectively. Throughout this work these num¬ 
ber density bins will be referred to as nl, n2, and n3, re¬ 


spectively. Full details of each number density bin stud¬ 
ied can be found in Table [2] The choices of the values 
for n2 and n3 are motived by covering a wide range of 
masses within our simulation mass resolution, with little 
to no overlap between populations. We do not explore 
higher number density (lower mass) values due to the 
restrictions set by our simulation’s resolution (see gray 
shading in Figure]!]). We also do not explore values lower 
than n3, as such abin would have < 50 tracked galaxies, 
leading to poor number statistics. 

It should be noted that the symmetric number density 
bin leads to a slightly asymmetric mass bin (i.e. more 
low mass galaxies), depending on the shape of the CSMF 
at the selection redshift. However this choice allows for a 
constant number of galaxies at each redshift for a given 
number density, whereas a symmetric mass bin would 
lead to varying galaxy counts due to the changing shape 
of the CSMF. 


3.2. Galaxy tracking 

The tracking of single galaxies across multiple snap¬ 
shots is achieved through particle ID matching and the 
“most massive” descendant/progenitor algorithm. The 
galaxy at each subsequent snapshot which contains the 
highest number of matching particles from the original 
galaxy is considered to be its descendant/progenitor. 

One downside to the “most massive” descen¬ 
dant/progenitor algorithm is that there are scenarios 
when the procedure is not completely reversible. For ex¬ 
ample, in merger events the progenitor path will choose 
the most massive of the two galaxies whereas the de¬ 
scendant path may have originated via the lower mass 
galaxy. To test the impact of this effect on our results 
we conducted a test with a random sample population of 
galaxies selected at z=7. This population was tracked to 
z =3 and the final GIDs were used as a starting point for 
a progenitor track back to z= 7. We found that ~ 10% 
of the progenitor population differed from the original 
z=7 population. However, this difference only led to a 
median mass differential of ~ 0.02 dex. Therefore, we 
conclude that the partial non-reversibility of the “most 
massive” algorithm utilized in this work has a negligible 
impact on our results. 

While the algorithm for tracking galaxies in the simu¬ 
lation is simple, in practice this is a non-trivial exercise. 
All group-finders introduce some amount of uncertainty 
when associating particles with a galaxy, especially dur¬ 
ing merger events. Particles which were included in one 
snapshot can be excluded in the subsequent one, primar¬ 
ily those which are located more near the edges of the 
identified system. This uncertainty manifests itself as 
“noise” in the mass growth of each galaxy (see jagged 
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Fig. 2.— Actual mass growth for descendants, with individual galaxies shown as the gray lines, and medians in redshift bins shown by the 
orange/green/blue circles/squares/triangles, compared to the inferred mass growth by tracking a constant number density from the CSMF 
(solid orange/green/blue lines). The shad ing represents the sta ndard deviation of the stellar mass at a given redshift of the individual 
simulated galaxies. Observed results from |Papovich et al. | (|2011) (cyan square) and |Salmon et al.| ((2015) (yellow diamonds) are show for 
comparison. These observations were obtained trom ttie same number density as our nl bm. While the inferred mass growth does appear 
to track the average mass growth of the individual simulated galaxies, there is both a bias of 0.2 dex and a substantial scatter of d=0.30 dex 
by z = 3. 
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Fig. 3.— Actual mass growth for progenitors, compared to the 
inferred mass growth from the CSMF. All symbols, lines and shad¬ 
ing represent similar quantities as in Figure [2] though here for 
progenitors of z = 3 galaxies. While again this technique appears 
to work on average, both the bias and the scatter of the constant 
number density tracking technique are larger when tracking pro¬ 
genitors than when tracking decendants. 


gray lines in Figure. [2]). While this “noise” is aesthet¬ 
ically unpleasing, it does not affect the overall median 
growth trend. 


3.3. Actual versus inferred mass growth 

Throughout this work we will refer to the actual mass 
growth and the inferred mass growth of the studied 
galaxy population. The inferred growth refers to the 
mass assembly of the ensemble galaxy population as de¬ 
rived from the CSMF by taking the mass associated with 
a constant number density cut at each redshift (e.g. the 
orange line intersection with CSMF in Figure IT]) . The 
actual mass growth is the mass assembly unique to each 
individual galaxy (gray lines in Fig ure. [2| determined at 
each snapshot as described in Sec. 3.2 In each case the 


actual mass growth will be represented by the median 
of the distribution with the ler scatter represented by 
shading (e.g. orange circles, orange shade Figure. [2]). 


4. RESULTS 
4.1. Descendants 

In Figure [2] we compare the inferred mass growth to 
the median of the actual mass growth for each galaxy in 
the nl, n2, and n3 bins. This comparison illustrates that 
the inferred growth tracks the actual mass growth on av¬ 
erage, though there appears a systematic bias where the 
inferred mass growth is ~ 0.20 dex higher (within a fac¬ 
tor of < 2 in mass) than the actual mass growth. This 
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Fig. 4.— Redshift evolution of the number density for each population (nl, n2, and n3) for both descendants (left) and progenitors 
(right), with the shaded regions denoting the la scatter. For the progenitor plot (right), galaxies that form after z =7 are dropped at their 
formation redshift as we approach higher redshifts and therefore do not contribute to the scatter prior to thei r for mation ( i.e., the total 
number of galaxies is reduced, and the average mass does not include objects with zero stellar mass; see Section |4.5| for more details). This 
makes the scatter information less reliable at higher redshifts for progenitors. We provide linear least square fits (solid line s') to t he median 
evolution for each population (solid orange, green, blue lines) which can be directly compared with the results from jllehroozi et al.1 1 | 20 1 3njl , 
shown as the dashed lines, showing excellent agreement, in spite of the two very different techniques between our two studies. 


demonstrates that, on average, using constant cumula¬ 
tive number density to track the descendants of a pop¬ 
ulation of galaxies through cosmic time can be an effec¬ 
tive method of measuring stellar mass growth. This re¬ 
sult is consistent w ith st ud ies using t he same techniq ues 


at lower redshifts (van Dokkum et al. 


201 .‘> [) and at high-xi ( fl J apovich et al.|2UI l[|Befiroozi et al. 


2010} Leja et al. 


2013a). We caution however that this is an on average 
result which has a large scatter (~ ± 0.30 dex) which we 
will discuss in more detail in Section |4.5| and a system¬ 
atic offset which suggest that ev oluti on of the population 
must be considered (see Section 4.3). 

In Figure [2] we also compare to stellar m ass growt h es¬ 
timate s obt ained through ob s ervatio ns by | Salmon et al. 
( 2015]) and |Papovich et al. (2011). This comparison 
demonstrates that our simulations, on average, are repro¬ 
ducing the observed growth of g alaxies sele cted from a 
bin similar to our nl. As Salmon et al. (2015) results were 
obtained using the evolving cumulative number density 
technique, it also serves as a good check that our sim¬ 
ulations are reproducing the observed galaxy properties 
and evolution well. 


populations in the nl, n2, and n3 bins. As with the de¬ 
scendants, the constant cumulative number density tech¬ 
nique, on average , reproduces the mass assembly history 
of the population of galaxies when compared to the in¬ 
ferred assembly, although with a larger bias of 0.30 dex 
(factor ~ 2 in mass). 

However, when using this technique to track progeni¬ 
tors the scatter in the individual galaxy history increases 
substantially to ~ ±0.70 dex at z =7 (Figure [3]), and is 
much higher than was seen when tracking descendants 
(~ 0.30 dex). Further, we find that ~ 30% of the galax¬ 
ies in the original number density bin at z =3 do not 
exist at z= 7, forming at various points between z=2> and 
z= 7. This population of “late-forming” galaxies substan¬ 
tially increases the scatter in the actual assembly history 
and thus increases the offset between the actual and in¬ 
ferred mass growth. It should be noted that we drop 
the late-forming galaxies from the calculation of average 
mass growth prior to their formation redshift to avoid in¬ 
cluding objects with zero stellar mass. We include more 
discussion regarding the im pact of this “late-forming” 
galaxy population in Section 5.2 


4.2. Progenitors 

We also use our simulations to test the effectiveness 
of the constant cumulative number density technique to 
track the progenitors of a population of galaxies selected 
at z=3. For the purposes of this study we are defining 
a galaxy progenitor to be it’s most massive progenitor. 
We acknowledge that in a hierarchical galaxy formation 
scenario each galaxy is the aggregate of many progeni¬ 
tors of varied masses. However, from an observational 
perspective, it is impossible to link multiple progenitors 
to a single descendant. Therefore we feel that our defi¬ 
nition more closely approximates what can be observed. 
Figure [3] shows the mass assembly history of z = 3 galaxy 


4.3. Evolving number density 

The systematic offset between the actual and inferred 
mass growth suggests that the number density of the 
original population is evolving with redshift. This evolu¬ 
tion can be seen in Figure [4] which shows the evolution 
of the number density as a function of redshift for both 
descendants (left) and progenitors (right), for each of our 
initial number density bins. The number density for each 
individual galaxy is obtained at each snapshot by relating 
its stellar mass to a number density via the CSMF. All 
populations show a trend of increasing number density as 
a function of cosmic time (with decreasing redshift for de¬ 
scendants, and with increasing redshift for progenitors). 
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This demonstrates that the average actual mass growth 
of galaxies from each number density bin are growing 
slower than would be inferred from the CSMF (see off¬ 
set between inferred and actual stellar mass growth in 
Figures [2] & [3]). 

This trencT i s highl y consistent with findings from 
(2013a) who utilized the Bolshoi (Klypin 


Behroozi et al. 


et al. 2011) dark ma tter simulations and abundance 
matching (see Section 5.4 for further details). In Fig¬ 
ure p] we compare our linear best fit (solid li nes) to the 
results from the public code provided by Behroozi et al. 
(2013a I; dashed lines. Our work shows excellent agree- 
rnent (wit hin ~ 0.10 dex) with the evolution generated 


by the Behroozi et al. (2013a!) code over the studied red- 


shift range. 

Thus, in order to track galaxies more accurately 
through cosmic time one must utilize an evolving cumu¬ 
lative number density. By taking the average of the least 
square fit to the evolutionary trend for each population 
(see least square fits in Figure of descendants of z=7 
galaxies (10 848 < M s /M 0 < 1(P 55 ) we find, 


log nf = log rii — O.lOAz 


(1) 


Similarly the number density of z =3 (10 10 39 < 

M s /Mq < 10 10 - 75 ) progenitors can be traced utilizing, 


log n/ = logni + 0.12A^ 


( 2 ) 


We caution that these relations are only valid in the 
mass range and red shift range studi e d here (3 < z < 7). 
As pointed out in Behroozi et al. (2013a|, the evolu¬ 
tion changes at z < 3, therefore equations [l] & [2] should 
not be extrapolated to lower redshifts. Note that the 
mass ranges referenced above are derived from the over¬ 
all number density range (nl, n2, n3) at either z =7 for 
descendants or z =3 for progenitors (see Table [2]). 

4.4. Completeness 

One method which has been used to quantify how well 
the constant cumulative number density method is track¬ 
ing the evolution of the galaxy population is to examine 
the completeness fraction at each redshift, keeping the 
bin width constant. Here we define the completeness 
fraction to be the fraction of tracked simulated galax¬ 
ies which fall within the desired number density bin at 
a given redshift. In Figure [5] we present the results of 
this study, where we find that only ~ 13% of z=7 de¬ 
scendants fall within the original number density bins at 
z = 3. For clarity, we only show results for the nl bin, 
however, the results for the n2 and n3 bins are quan¬ 
titatively similar. We do not present the completeness 
fraction for progenitors for reasons which we discuss in 
Section 14.51 

We also compare our results to those found in) 


et al. (2011). To make this comparison we need to mod- 
ify our definition of completeness to match that used in 
Papovich et al. They define completeness as the frac¬ 
tion of galaxies with a number density of at most the 
value of the selection bin (n(> M s )). Our direct compar¬ 
ison to this definition can be seen by the dashed orange 
line. This definition results in a co m pleteness f ractio n of 
~ 35%, much lower than the Papovich et al. ( 2011|) re- 
sult of ~ 55%. To obtain a similar result to the Papovich 



Fig. 5. — Completeness fraction, defined to be the fraction of 
tracked galaxies at each redshift which fall within the desired num¬ 
ber density bin, keeping the bin width constant. The solid orange 
line represe nts the completeness fr action for the nl bin. We also 
compare to |Papovich et al.| (|2011 [) (gray circles), who define the 
completeness as tlie traction ot galaxies with a number density less 
than or equal to the value of the selection bin. Our direct compari¬ 
son to the published data is the dashed orange line. The cyan solid 
and dashed lines represent the completeness fraction after applying 
the evolving cumulative number density, both using our definition 
of completeness, and that of Papovich et al. We find that even 
after the correction for the evolving number density, only 23% of 
the 2 : =7 population is recovered at z =3. 


et al. (2011) study we would need to expand our target 


Papovich et al. 

(2011) 

leteness traction 

nearly 


mass bin at z =3 to ±0.29 dex. 

Repeating both of the completeness calculations above, 
now applying the evolving cumulative number density 
leads to a completeness fraction of ~ 23% as defined 
in this work and ~ 58% for the 
definition. This ma kes o ur com pl 

identical to the [Papovich et al. (2011) result using their 
less restrictive definition. However, we see a modest in¬ 
crease of only 10% at z =3 after applying the evolving 
cumulative number density when using our completeness 
definition. This discrepancy is likely due to the larger 
scatter between halo mass, galaxy mass and luminosity 
found in our simulations. The study by |Papovich et al.| 
(20111 did not include any of these sources of scatter in 
their 3stimates. 

The results from the completeness fraction study are 
somewhat misleading as the width of the z =3 mass bin 
associated with the selected number density is smaller 
than the original z=7 mass bin by a factor of ~ 2 (see 
Table [2|. The reduced mass bin width is the result of the 
steepening of the CSMF over the redshift range studied, 
and it is the primary reason why we only see a modest 
completeness fraction increase of 10% at z =3 when ap¬ 
plying the evolving cumulative number density. If the 
mass bin width is fixed at the z =7 value of ~ ±0.20 dex 
(see Table [2|), the completeness fraction at 2 =3 rises to 
- 55%. 


4.5. Scatter 

As demonstrated by the low completeness fraction, the 
majority of galaxies selected at z=7 are not falling within 
either the evolved or constant number density target bin 
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log M s [Mo] 



log M s [Mq] Number 



Fig. 6.— CSMF for 2=7-3 simulated galaxies (blue solid lines, left to right) shown with the evolution of the median number density 
of the descendants of z = 7 galaxes in the nl bin (orange squares). The orange horizontal hash lines represent the la scatter around the 
median at each redshift. The histograms illustrate the distribution of the simulated descendants in both number density (orange) and 
stellar mass (blue) at the terminal redshift of z = 3. While the distribution is wide there does exist a clear peak in both number density 
and mass. 


log M s [Mg ] 



log M s [M 0 ] Number 


Fig. 7.— The same lines and symbols as Figure [6] only here 
for progenitors of z = 3 galaxies tracked to z = 7. The red 
histogram bar in the progenitor panel represents the population 
“late-forming” galaxies found in the progenitor study. Progenitors 
of 2 =3 galaxies have a much wider distribution in both mass and 
number density that seen from descendants of z =7 (see Figure^. 


at z= 3. It is therefore useful to constrain where the 
majority of these descendants fall on the CSMF. Fig¬ 
ures [h] and [7] shows the median number density evolu¬ 
tion of simulated galaxies in our nl bin plotted with 
the CSMF at each redshift for descendants and progen¬ 
itors, respectively. These figures synthesize much of the 
data provided in this work as it also includes informa¬ 
tion regarding the mass and number density distribution 
(blue/orange histograms) for both descendants and pro¬ 
genitors. 

a. For the nl bin descendants, we find that ~ 11% of 
gala xies fall in the nl bin at z= 3, as discussed in Sec- 
5 tion |4.4[ and the remaining galaxies are distributed either 

_ 4 a at higher number densities (~ 67%), or at lower number 
'5' densities (~ 22%) than the original number density bin. 
The overall scatter in stellar mass at z =3 is ±0.32 dex, 
which corresponds to a number density scatter which has 
a total width of ~ 1.2 dex, given the z = 3 CSMF. For 
progenitors of the nl bin, we find that ~ 13% fall within 
the target z = 7 number density bin with ~ 67% at 
higher number densities, and ~ 21% at lower number 
densities. This distribution leads to a stellar mass scatter 
of ±0.75 dex at z= 3, and a number density scatter with 
total width of ~ 1.1 dex. It should be not ed th at the 
“late-forming” galaxies discussed in Section |4.2| are in¬ 
cluded in the fraction of galaxies which fall above the nl 
bin. However, they are removed from the sample at their 
formation redshift. Therefore, they do not contribute to 
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the median scatter (gray shaded area in Figure [7]) at 
z =3. The n2 and n3 density bins have quantitatively 
similar results. 

It is interesting to consider the growth histories of 
galaxies which started in the selected number density 
bin at z=7, and end up outside the bin at z= 3. We 
performed a test to determine the average growth rates 
of these populations of galaxies, calculating the average 
stellar mass at each time step of galaxies outside the bin 
(separately for those with both higher and lower num¬ 
ber densities). We find that these average growth his¬ 
tories show that these galaxies initially grow too quickly 
or too slowly (moving to lower or higher number densi¬ 
ties, respectively), jumping out of the selected number 
density bin, but after that, they grow at a relatively con¬ 
stant rate, similar to those galaxies inside the selected 
bin. Therefore, there is no systematic difference in the 
growth histories of these galaxies; rather, after they ini¬ 
tially scatter out of the number density bin, they grow 
fairly similarly to the tracked population. 

The fact that both the descendants and the progeni¬ 
tors have similar fractions of galaxies located in, above 
and below the original bin is misleading. As mentioned in 
Section |T4} the slope of the CSMF is steepening with de¬ 
creasing redshift, resulting in a narrower number density 
bin at 2=3 and a lower descendant completion fraction. 
This must also be considered when looking at progenitors 
as the bin is widening as we approach z=7 , thus making 
it more likely that a galaxy will fall within the selection 
bin (i.e. higher completion fraction). 

As demonstrated by the mass and number density his¬ 
tograms included in Figure [7J even using an evolving cu¬ 
mulative number density selection results in a wide scat¬ 
ter in both number density and mass for progenitors. We 
conclude that using an evolving cumulative number den¬ 
sity will allow for the tracking of average stellar mass for 
progenitors, but that this method is an insufficient tracer 
of individual galaxy descendant/progenitor evolution in 
our simulations, as nearly any bin on the z =3 CSMF can 
be populated with galaxies that originated from nearly 
any point on the z=7 CSMF. We show furth er evidence 
of this scatter and its impact in Section |5.1[ 



log Ms [M 0 ] 


Fig. 8. — The mass distribution of z = 7 simulated galaxies, 
which were tracked as progenitors of 2 : = 3 galaxies, which were 
themselves tracked as descendants of z = 7 galaxies in the nl num¬ 
ber density bin (effectively a “round-trip” tracking exercise). Here 
we show the stellar mass distribution for all galaxies selected in a 
mass bin at 2 = 3 which cont ained 50% of the originally-tracked 
z — 7 descendants (see Section |4.4| for details). The gray shading 
indicates the original nl bin width and the red bar represents the 
late -form ing population found during progenitor tracking (see Sec¬ 
tion |T2|. This “round trip” tracking exercise demonstrates that a 
given point on the z=3 CSMF can be populated by galaxies which 
have origins at nearly any point on the z=7 CSMF. 

bution of the 2175 2=3 progenitors, and the gray shaded 
region indicates the mass width of the original nl number 
density bin. This exercise effectively demonstrates that 
even a moderate widening of the target mass bin will 
lead to substantial contamination from a diverse popu¬ 
lation of galaxies. This exercise also shows that a large 
scatter makes the application of either the constant cu¬ 
mulative number density or evolving cumulative number 
density technique limited to average population masses 
only. The diverse mass assembly histories of individual 
galaxies can make associating other properties such as 
age, metallicity, star-format ion rate, etc., untenable. 


5. DISCUSSION 
5.1. Scatter 

To further explore the implications of the large scatter 
in descendants, an exercise was conducted to determine 
the mass bin width required at z=3 to obtain a com¬ 
pleteness fraction, as defined in this work, of 50%. It 
was found that an increase of a factor of ~ 3 from ±0.10 
dex in stellar mass to ±0.29 dex in stellar mass at z =3 
is required to recover 50% of the original z=7 selected 
population (see Table [2]). While a mass bin with a width 
of ±0.29 dex may not seem extreme, as it is comparable 
to the observed mass error estimates at 2 = 3, it will 
contain thousands of galaxies that were not part of the 
original 2=7 density bin, which may have very different 
mass assembly histories. This can be see in Figure [8] 
where the stellar mass distribution for a sample of 2175 
galaxies were selected from the aforementioned 50% bin. 
The progenitors of this sample were then tracked back to 
z=7 for comparison to the original z=7 population. The 
blue histogram represents the 2 = 7 stellar mass distri- 


5.2. Late-forming galaxies 

The ~ 30% of 2=3 progenitor galaxies found in our 
simulation to form after 2=7 have a significant impact 
on the results of the progenitor portion of this study. 
In particular, the scatter of the stellar mass and num¬ 
ber density evolution is increased by a factor of ~ 2. 
The high fraction, and significant impact of the “late- 
forming” population dictate that further discussion is 
warranted. Specifically, there is the possibility that these 
galaxies are artifacts of our simulations due to resolution 
limitations. The large volume of our simulation box is 
chosen so that we can best represent the dynamic range 
of observed high redshift galaxies. The consequence of 
this design is that we are not resolving low-mass halos 
with Mf, < 10 9 Mq at 2 >7. Therefore, the possibility 
exists that a theoretical simulation box with unlimited 
mass resolution would have resolved these “late-forming” 
galaxies at an earlier redshift. 

In a ACDM Universe with a hierarchical structure for¬ 
mation scenario, it is possible that these galaxies would 
have then been merged into a higher mass galaxy by 
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z= 7, thus reducing the scatter. It is also possible that 
no mergers took place which would leave the galaxy in 
a lower mass bin (i.e. much higher on the CSMF). In 
this scenario the scatter would remain largely unchanged. 
To properly study this issue, a resolution test would be 
needed, which requires a simulation with a substantially 
higher particle count. Unfortunately, such a hydrody¬ 
namic simulation is beyond the scope and resource of 
this study. 

There is also the possibility that the “late-forming” 
galaxies are very “real”, and representative of early 
galaxy formation. A simple calculation, exploring the 
growth rate required for a galaxy at z =5 with M s ss 
1O 71 M 0 to reach the target stellar mass of M s ss 
10 10 ' 4 M q by z= 3, demonstrates that such a galaxy would 
require a star formation rate (SFR) of ~ 25 Mq yr -1 
over the ~ 1 Gyr. This SFR is reasonable, given that 
a typical observed L* galaxy has an SFR« 50 M 0 yr~ x 
at z =5 ? . This is without consideration for mergers, 
which could increase the stellar mass much more rapidly. 
Therefore, there is no physical motivation for us to defini¬ 
tively conclude that the “late-forming” galaxies are mere 
resolution artifacts. 

We investigated the photometric properties of the pu¬ 
tative “late-forming” galaxies to determine if it would be 
possible to detect z =3 observational signatures. More 
specifically, we examined the D4000 spectral indicator, 
which measures the relative strength of filters placed 
around the 4000A sp ectra l features ass ociated with Cal¬ 
cium H and K lines (Bruzual A. 1983). The strength of 
this break is correlated with age, as younger, high SFR 
galaxies will produce an increase of UV photons at wave¬ 
lengths less than 4000A. The spectra of these galaxies 
will also contain less observable metal-line absorption, 
which will result in _a decrease d D4000 ratio. Utilizing 
the BC03 (Bruzual & Chariot 2003) population synthe¬ 
sis models to generate spectra tor each of our simulated 
galaxies, we found that there is no significant difference 
in the D4000 break for “late-forming” galaxies as com¬ 
pared to the remaining studied population. It is likely 
that this is the result of a combination of merger events 
and rapid star formation that leads to the effective mix¬ 
ing of older stellar populations with the newly formed 
populations making the 4000A an unreliable age indica¬ 
tor for our galaxy population. 


5.3. On the origin of the evolving number density 

When interpreting the results of this work, and other 
similar studies, it is critical to remember that number 
density is not a physical property of a galaxy (i.e. un¬ 
like stellar mass, SFR, metallicity, etc.). The number 
density of any individual galaxy is dependent directly on 
the distribution function from which it is taken, in this 
case the cumulative stellar mass function. As seen in 
Figure [l] the CSMF is evolving with decreasing redshift 
towards more galaxies at higher masses, but also toward 
a stronger high-mass exponential cut-off. There are two 
primary forces driving this evolution of the CSMF over 
the studied redshift range: First, the stellar masses of ex¬ 
isting galaxies are growing, which evolves the CSMF to 
toward higher masses. Second, new galaxies are forming 
and being included which, in a “bottom-up” hierarchical 
structure formation universe, will place them at higher 


number densities, effectively acting to steepen the expo¬ 
nential cutoff of the CSMF. We caution that this steep¬ 
ening is not related to the faint-end slope of the stellar 
mass function as our dynamic range does not include 
faint-end mass galaxies. 

The steepening of the CSMF cutoff directly results in 
the evolution of the cumulative number density in our 
simulations. This can be clearly demonstrated by exam¬ 
ining the rise in cumulative number density for a given 
stellar mass bin as the exponential cutoff becomes more 
pronounced (i.e. towards vertical). Therefore, a popula¬ 
tion of galaxies can have smoothly rising mass assembly 
histories, as seen in Figure [21 with no break in rank or¬ 
der, and still exhibit an evolving number density. This 
result is contradictory to the fundamental assumptions 
that are made in the constant cumulative number den¬ 
sity technique, whic h has been shown to work at lower 
redsh ift (0 < z < 3; van Dokkum et al.||2010 Leja et al.| 
20131. We do not however feel that our results invali- 
date the lower redshift results as the processes driving 
galaxy evolution are different during that more recent 
epoch (i.e. driven by stellar mass growth and not new 
galaxy formation, with an increased impact of feedback 
on massive galaxies). 

Based on the above evolution argument, one would ex¬ 
pect that the progenitor number density evolution would 
then trend towards lower number densities at higher red- 
shifts. We do not see this trend. In fact we find the oppo¬ 
site, with progenitors evolving to higher number densities 
(Figure [TJ bottom). This can be explained by the impact 
of the large “late-forming” galaxy population acting to 
lower the median stellar mass growth (Figure [3]), thus 
raising the cumulative number density. 

It must be noted that the steepening of the CSMF 
in our simulations could also be the result of our mass 
resolution limitations as discussed in Section 15.21 Low 
mass galaxies which are unresolved would fall at higher 
number densities, and could therefore steepen our z = 7 
CSMF, potentially eliminating the evolution seen in this 
work. Cosmological volume hydrodynamic simulations 
with sufficient dynamic range to test this effect are be¬ 
yond the scope of this work based on the computational 
resources required. 

5.4. Implications of scatter for abundance matching 


The agreement with the Behroozi et al. (2013a I re¬ 
sults is rather remarkable given the differing a pproaches 
taken to arriv e at these conclusions. In the IBehroozTl 
et al. (2013a) work an abundance matching technique 
is employed. This technique converts the cumulative 
number density corresponding to a galaxy’s stellar mass 
at a given initial redshift to a dark matter halo mass 
with the same cumulative number density from the dark 
matter-only Bolshoi simulation halo catalog. The pro¬ 
genitors/descendants of these halos are then tracked 
within simula tion us in g a si milar ‘most-massive’ algo¬ 
rithm (Behroozi et al. 2013b). At the terminal redshift 
the mass of each halo is then converted back to a cumu¬ 
lative number densit y usin g the dark matte r halo cumu¬ 
lative mass function (Behroozi et al. |2013c|. 

At the core of the abundance matching technique is 
the fundamental assumption that halo mass accretion, 
whether it be through mergers or via the cosmic web, 
directly traces the galaxy stellar mass growth hosted 

























10 


Jaacks et al. 


within. This is a reasonable assumption given that we 
know that the baryonic matter in the Universe dir ectly 


traces the underlying dark matter distribution (Blanton 


et al. 1999). It does not however account for the com¬ 


plicated physics of how this gas accretes onto galaxies 
and ultimately is converted into stars. This absence of 
fundamental physics is the primary motivation for this 
work. 

The utilization of cosmological SPH simulations is the 
next logical step in testing these galaxy-tracking tech¬ 
niques, as they include both the underlying dark mat¬ 
ter structure and the complicated hydrodynamic physics 
associated with the gas within (i.e. heating, cooling, 
chemistry). These simulations are also able to convert 
gas into stars via physically motivated sub-grid physics 
as well as account for the impact of star formation (i.e. 
supernova and stellar winds) on further star formation. 
This allows for galaxies to form in-situ with their respec¬ 
tive dark matter halos, while computing the gas infall 
rate from cosmic filaments in a cosmological setting cor¬ 
rectly.. Therefore we are able to directly track galaxy 
stellar mass growth without the need to make any as¬ 
sumptions regarding the relationship between the galaxy 
and the halo it resides within. The drawback to uti¬ 
lizing these types of high resolution SPH simulations is 
that they are computationally very expensive. This lim¬ 
its the dynamic mass range and redshift range we are 
able to study when compared to dark matter only sim¬ 
ulations (i.e. Bolshoi, Millennium), which resolve lower 
mass structures. To probe this topic further, a combina¬ 
tion of theoretical techniques is likely warranted. 


6. SUMMARY 

Using the cosmological smoothed particle hydrody- 
namical code GADGET-3 we make a realistic assessment 
of the validity of the technique of using constant cumu¬ 
lative number density as a tracer of galaxy evolution at 
3 < z < 7. Our major conclusions are as follows: 


• We find that, when utilizing the constant cumula¬ 
tive number density technique to track galaxies at 
3 < z < 7, an average stellar mass can be deter¬ 
mined to within a factor of < 2 for descendants and 
~ 2 for progenitors (Figure [2] & [3]). 


• The systematic offset between the inferred mass 
growth and the actual mass growth suggest that 
the number density of both progenitors and descen¬ 
dants is evolving with redshift. This evolving cu- 
mula tive number density is in very good agreement 
with Behroozi et al. (2013a) who find a similar evo¬ 
lution over the same redshift range (Figure [4j. 


• On average, the cumulative number density for the 
stellar mass ranges studied in this work (10 8 ’ 48 < 
M s /Mq < 10 9 - 55 ) can be found using, 

logn/ = logn* — O.lOAz (3) 

for descendants, and, 


logn/ = logn* + 0.12Az (4) 


for progenitors. 


• While the evolving cumulative number density is 
able to track the on average evolution of galaxies 
at 3 < 2 < 7, we find that there is a large scat¬ 
ter of ~ ±0.30 dex for descendants at 2 = 3, and 
~ ±0.70 for progenitors at z =7 (Figures [2j [3j |6]& 
[ 7 ]). Individual galaxies demonstrate diverse mass 
assembly histories due to the physical processes of 
gas accretion, quenching, mergers and star forma¬ 
tion. 


• When evaluating the completeness fraction, we find 
that only ~ 23% of descendants fall within the orig¬ 
inal number density bin by z = 3 (Figure [5]). This 
low fraction is due to the narrowing of the target 
mass bin which is caused by the steepening of the 
CSMF. 

• Increasing the number density bin width at z =3 to 
improve the completeness fraction of z =7 descen¬ 
dants to 50% leads the contamination of the orig¬ 
inal sample by thousands of galaxies which have 
origins outside of the targeted number density bin. 
Thus, significantly reducing the reliability of the 
evolving cumulative number density method. 


The technique of utilizing an evolving cut of cumula¬ 
tive number density is a powerful tool in tracking the 
average mass growth of galaxies at 3 < z < 7. This work 
demonstrates that galaxies in a given number density 
bin will have very diverse mass assembly histories even if 
they are of similar stellar mass. This diversity will reveal 
itself in the scatter of other physical properties of indi¬ 
vidual galaxies (i.e. metallicity, age, SFR, star formation 
history), thus caution should be used when applying this 
technique to study physical properties other than stellar 
mass. 
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